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Abstract. We provide some statistics about an irreducibil- 
ity/reducibility test for multivariate polynomials over finite fields 
based on counting points. The test works best for polynomials in 
a large number of variables and can also be applied to black box 
polynomials. 



1. Introduction 

Let / € FJ polynomial. Since f(x) can take only q possible 

values for every point in x G A n (F g ) we expect that f(x) = for about 
- of the points A n (¥ q ). If on the other hand / = gh is a product of two 
polynomials g, h £ ¥ q [xi, . . . , x n ], we have f(x) = if g(x) = or h(x) = 0. 
So one might expect that products of polynomials satisfy f(x) = g{x)h{x) = 
for approximately | — of the points x £ A n (F g ). This phenomenon is 
well explained by the Weil formulas JVlil80 . 

In this article we investigate the following irreducibility test for multivariate 
polynomials / over ¥ q : 

Evaluate / at N random points. We reject the hypothesis that / is reducible, 
if the fraction of zeros J q (f) found is significantly smaller than | — \. Note 
that 99.5% of all polynomial functions satisfy 

7 9 (/) < ~ + 2-58 

This irreducibility test is quick, since the number of evaluations needed to 
detect a given percentage 1 — e of all products of polynomial functions or all 
general polynomial functions do not depend on the degree of the polynomials 
considered respectively, i.e. 

N ~ O(-glne). 

On the other hand it is dirty, since it does not give a definite answer. More- 
over we cannot make e arbitrarily small, because ./V is bounded by q n , the 
number of ¥ q rational points in A n (¥ q ). There will always be a few polyno- 
mials that cannot be correctly classified by our method at all. For example 
the product of an irreducible, absolutely reducible polynomial with a further 
absolutely irreducible polynomial. 




Date: February 8, 2008. 



1 



2 



H.-C. GRAF V. BOTHMER AND F.-O. SCHREYER 



The test works for implicitly given (black box) polynomials as well. We give 
examples of such polynomials below. 

The expected fraction of zeros for special classes of polynomials is also larger 
than - . For example, the expected fraction of zeros for n x n determinants 
is 

£(7 9) det) = l/q + l/q 2 ~ V? 5 " 1/q 7 + 0(1/ q 12 ) 

for n > 12. 
Notation 1.1. 

Fq the finite field with q elements 

X C A n an affine algebraic set 

X(¥ q ) the Fg-rational points of X 

\X\ = \X(¥g)\ the number of F g -rational points of X 

j q (X) the fraction of Fg-rational points in A n 

that are contained in X 
B(N,p, k) = — p) N ~ k the binomial distribution 

N the number of trials 

p the success probability 

k the number of successes 

N(fi, cr) the normal distribution with mean \i 

and variance a 2 



B(N,p) can be approximated by J\f(p, \/p(l — p)/N). 

2. Fractions of Zeros 

Example 2.1. We choose fixed polynomials /i, /2 of degree 5 and fa of de- 
gree 10 in Z[x%, . . . , 24] with coefficients in [—9, 9] using the random number 
generator of the computer algebra system MACAULAY 2 GS and consider 
/ = /1/2 + 7/3. Let X be the vanishing set V(f). 

A black-box polynomial is a polynomial for which it is easy to check f(x) = 
0, but the explicit formula for / in terms of the unknowns hard 
or impossible to write down. 

Example 2.2. Let S d C H° (P 2 , (d) ) be the hypersurface of singular 
homogeneous polynomials / of degree d in 3 variables. For each point 
/ G H (F 2 ,O(d)) it is easy to decide whether / £ Sd via the Jacobi cri- 
terion. On the other hand the equation of Sd in the ( d ^ 2 ) variables is not 
obvious. 

Example 2.3. Let C C P 4 be the determinantal curve of degree 10 and 
genus 6 defined by the maximal minors of the following 5x3 matrix 

/ X + Xi - X 3 - X4 Xo — X\ — X2 — X4 —x + X 3 + X4 \ 

—x — X 2 + X3 + X4 x — X\ — x 2 — X 3 + X4 —x + Xi — X 2 + X 3 + X4 

~Xq — X2 — X3 — X4 ~Xq — X\ — Xz — X4 —X\ + X4 

—Xl - X 2 - X 3 + X4 —Xl - X2 —xi + x 2 

\~X + Xl — X 2 — X 3 — X4 ~x + x 2 — Xz + X4 x — Xl + X2 + X 3 + X4 J 

Let D = {H G P 4 I H n C is singular} be the dual variety of C . 
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Definition 2.4. Let X C A n an algebraic set. We denote by 

\X(F„)\ 



|A»(F,)| 

the fraction of ¥ q -rational points on X. In particular for a hypersurface 
X = V(f) we have j q (f) = 7,(V(/)). We call 7 ,(/) the fraction of F,- 
rational zeros of f. 

Example 2.5. We estimate 7 g in three of our examples by evaluating in 
N = 1000 random points over all primes up to 17. The following table gives 
the 99% confidence interval for ^ q : 



Q 


X 


s 8 


D 


2 


56.7% ± 4.0% 


68.4% ± 2.9% 


55.3% ±4.1% 


3 


33.8% ± 3.9% 


42.3% ±3.1% 


49.2% ±4.1% 


5 


17.9% ±3.1% 


24.0% ± 2.6% 


24.9% ±3.5% 


7 


26.2% ± 3.6% 


16.8% ± 2.3% 


35.3% ± 3.9% 


11 


9.3% ± 2.4% 


8.9% ± 1.8% 


8.0% ± 2.2% 


13 


8.6% ± 2.3% 


9.6% ± 1.8% 


8.4% ± 2.3% 


17 


5.2% ± 1.8% 


8.1% ± 1.7% 


5.9% ±1.9% 



In this article we will explain these numbers. 

Remark 2.6. We can compute the true values 72 (-X") = 56.3%, 73 (-X") = 
34.6%, 75 (X) = 18.7% and 77(A) = 27.6% with the same effort, since there 
are less than 1000 rational points in A 4 (F 9 ) for q < 7. 

To study the map 

j q : F ? [ari...x n ] -> [0,1], / ^ lq (f) 
we note that 7 9 (/) factors over the ring R := map(A n (F 9 ), F q ): 




Lemma 2.7. ip is surjective. 

Proof. Since |A n (F,j)| = q n < 00 we can find a polynomial with prescribed 



values at these points via interpolation. 



□ 



We study the distribution of 7^ on R by regarding it as a random variable 
on the finite probability space 

(R,n,p) 

with Q the sigma algebra of all subsets of R and P the constant probability 
measure. 

Proposition 2.8. The distribution of j q on R is binomial 

*(*-£)-"(«'••;■* 

In particular the expectation value of ^ q is E(j q ) = ^. 
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Proof. We have to count the maps / E R that map precisely k different 
points to 0. Since the values at different points are independent, this number 
is 

<Ql*.(g-l)« B -* 

k 



The probability that 7 9 = is therefore 



□ 



Example 2.9. Consider maps f & R = map (A 4 (Fn),Fn). The distribu- 
tion of fractions of zeros is 

p ( 7ll = fc/n 4 ) =B(n\i/n,k) . 

From its approximation by the normal distribution AA(0.0909, 0.0024) we 
obtain 

P(0.0847 < 7ii < 0.0971) < 99%. 

We now consider products. The random variable 

7giU : Kx R^ [0,1], l q M9) = 1 q tfg) = \V(f)UV(g))\/q n 
which assigns to each pair of functions the fraction of zeros of their product. 
Proposition 2.10. On Rx R the distribution 0/7^ is 
P( lq , u = k/q n )=B(q n ,(2q-l)/q 2 ,k). 
In particular the expectation value of 7 gjU is 

Proof. The value of / • g in a point x depends on the values of / and g 
at x. There are q 2 ways of choosing these values of which (q — l) 2 give 
(/•<?)(*) 7^0. □ 

Example 2.11. Consider pairs (/, g) of functions in R as in Example 12.91 
The distribution of 711 u is now 

P (7ii,u = A:/ll 4 ) =£(ll 4 ,21/ll 2 ,fc). 

From its approximation by the normal distribution jV(0. 1736, 0.0031), we 
obtain 

P(0.1655 < 7ii,u < 0.1816) > 
Note that this range does not intersect 

P(0.0847 < 711 < 0.0971) > 



Geometrically products of functions correspond to the union of their zero- 
sets. 7q also behaves well under other geometric operations: 
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Figure 1. 99% of polynomial functions on A have 7 ? be- 
tween the continuous lines. 99% of products have "f q between 
the dashed lines. 

Proposition 2.12 (Intersection). Let X C A™ be a subvariety. We consider 
the random variable 

7 9 ,nx : R - [0, 1], J q ,nx(f) = \V(f) D X\/q n . 

The distribution of 7 g ,nx is 

P(lq,nx = k/q n ) = B(\X\, l/q,k). 

In particular, the expectation value of ^ q ^x is E{^ q ,r\x) = lq{X)/q, where 
r y q (X) = \X\/q n is the fraction of points on X in A n (¥ q ). 

Proof. Clearly, x G X n V(f) if and only if x G X and f(x) = 0. Since the 
values of / can be chosen independently on the points of X, we have 



P(x G keifr\X\x G X) 



1 



□ 



Corollary 2.13. Consider the random variable 

7g , n : R c - [0, 1], 7 ,,n(/l, ■ ■ ■ , h) = \V(h) (1 
Then the expected fraction of points is E{^ q ^) = ^ 

Proof. Use Proposition 12. 121 inductively. 



nn/ c )i/? n 



□ 



Notice that for polynomials fi, ■ ■ ■ , f c the expected codimension of 
F(/ 1) ...,/ c )cA n isalsoc. 
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Singular curves in P 2 
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Figure 2. The diagram shows the expectation values for 
various classes of polynomials in a large number of variables, 
and the measurement for Ss, the hyper surf ace of singular 
plane curves of degree 8. Note that the diagram tells that 
about 70 % of all plane curves over F2 are singular. 



Proposition 2.14 (Substitution). Let R m = map ( A n (F 9 ) , A m (F ? ) ) and 

X C A m (¥ q ) a subset. Consider the random variable 

7 q ,Subsf- R m -» [0,1], l q ,subst{4>) = I^^IA?" 

The distribution of j q S ubst ^ s 

P (l q ,subst = k/q n ) = B (q n n q (X), k) . 

In particular the expectation value of ^qsubst is E(-y q su bst) = l q {X) = 
\X\/q n . 

Proof. Choosing functions /1, . . . , f n is equivalent to independent choice of 
the image points. Therefore the probability of (j)~ 1 (X) containing exactly k 
points is the same as the probability of hitting k points of X while choosing 
q n points in F™. This gives the desired binomial distribution. □ 



3. Determinantal Varieties 

Even though we have shown, that E{^ q ) = - with a small variance on the 
set of all functions from A to ¥ q , there are special classes of functions that 
have larger expected j q . 

It turns out that this behavior is common for determinants: 
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Proposition 3.1. Let X C A nm be the determinantal variety of n x m 
matrices with n < m of rank less than n. Then the fraction of points on X 
is 

n-l 



7,w=i-n(i--^) 

i=0 V H ; 



i.e. X contains Jq(X) • q nm points. 



Proof. We prove that the number of matrices that have maximal rank is 

n-l 



n ^ - 



i=0 

by induction. M is a matrix of full rank if and only if the first n — 1 rows 
form a matrix of full rank and the last row is linearly independent of the 
first n — 1 rows. Since there are q n ~ l linear combinations of the first n — 1 
rows we obtain a further factor (q m — q""^ 1 ). 

□ 

Corollary 3.2. On the space of matrices R nm , consider the random variable 

l g ,det- R nm -» [0, 1], 7,,det(M) = \{x £ A n | rankM( 3 ;) < n}\/q n . 
Then the fraction of zeros has expectation value 

n—l 

I 1 \ 1 

+ ... 



£(7 9 ,det) = i - n y- - -^i) = ^ 



m— n+1 



i=0 

T/ie distribution of jq^det is 

P (l q , dc t = k/q n ) = B(q\E( lqAct ),k) 



Proof. Substitute functions for the variables in the generic n x m matrix 
and use Proposition 12.141 □ 



In the special case ofnxn square matrices we have 

£(7 3 ,det) = l/q + 1/q 2 ~ 1/q 5 ~ l/q 7 + 0(l/q 12 ) 

for n > 12. 

Example 3.3 (Example 12.21 continued). For small primes the divisor 
has more points than expected for irreducible polynomials, but not enough 
to seem reducible, see Figure |2j Our measurements are consistent with 
the well known fact that Sd is an irreducible determinantal hypersurface 
|(1K/941 Chapter 13, Prop. 1.6 and 1.7]. 
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Figure 3. C has a simple node over F7 and is smooth over 
F p for p = 5, 11, 13, 17. 

4. Testing 

To decide between two binomial distributions with success probabilities pi < 
P2 and N experiments, we compute empirical probability p = jj and decide 
for pi if 



P < Vmiddle = VP1P2 



a/pi(1 -P2) + VMi-pi) 



VP1P2- 



^pi(i-pi) + vmi -P2) 

To achieve a confidence level of 1 — e we choose s = s(e) such that 

1 



and N such that 



*00 



N > s(e) 



00 ^2 

e 2"dx = e 



y/p^l - pi) + ^2(1 ~ P2) 

P2 ~Pl 



In our case we have 



1 r-a- 1 ) 



for 1 — e of all polynomials and 

2q-l 

P2 > 5— ■ 



'2g 7 l/ 1 _ 2g-l- 



for 1 — e of all products of polynomials. The decision based on the empirical 
probability p = is then correct in 1 — e cases of the experiments. Note 
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however, that for fixed n and q we cannot make e arbitrarily small, since we 
need p\ < P2- 

An easy calculation gives the following estimate 

VN>s(e) 2^ 

q — 1 — 2sq 2 

for q > 3, which approaches s(e)y/2q for large n or q. Since s(e) = 
0(y/— hi(e)), we conclude that N grows like 0(—qlne). 

For e = 0.5%, s = 2.58, the number of trials needed is 





2 


3 


5 


7 


11 


13 


17 


n 


= 1 


oo 


oo 


oo 


oo 


oo 


oo 


oo 


n 


= 2 


CO 


oo 


oo 


oo 


oo 


oo 


oo 


n 


= 3 


oo 


oo 


oo 


28373 


2355 


1908 


1669 


n 


= 4 


oo 


oo 


1103 


647 


634 


682 


803 


n 


= 5 


oo 


1705 


367 


369 


482 


551 


695 


n 


= 6 


oo 


384 


259 


308 


447 


521 


673 


n 


= 7 


4457 


224 


225 


289 


437 


513 


667 


n 


= 8 


619 


173 


212 


283 


434 


511 


666 


n 


= 9 


295 


151 


206 


280 


433 


511 


666 


n 


= 10 


197 


140 


204 


279 


433 


511 


665 



oo indicates that there are not enough points in A n (F g ) to perform the test 
for the required e = 0.5%. In case we can perform the test, the deciding 
number of successes Np m i^ e is 





2 


3 


5 


7 


11 


13 


17 


n 


= 1 


oo 


oo 


oo 


oo 


oo 


oo 


oo 


n 


= 2 


oo 


oo 


oo 


oo 


oo 


oo 


oo 


n 


= 3 


oo 


oo 


oo 


5607 


301 


207 


139 


n 


= 4 


oo 


oo 


303 


128 


81 


74 


66 


n 


= 5 


oo 


754 


101 


73 


61 


59 


57 


n 


= 6 


oo 


170 


71 


61 


57 


56 


55 


n 


= 7 


2821 


99 


61 


57 


55 


55 


55 


n 


= 8 


391 


76 


58 


56 


55 


55 


55 


n 


= 9 


186 


67 


56 


55 


55 


55 


55 


n 


= 10 


125 


62 


56 


55 


55 


55 


55 



5. Higher codimension 

In principle this method can be applied to algebraic sets of higher codimen- 
sion. 

Consider two surfaces in P 4 and their union. We would like to distinguish 
their union form the irreducible examples. One possibility is to consider the 
Chow form which is a determinantal hypersurface on G(2, 5) in this case. 
In Figure 0] we indicate the 5% and the 95% quantiles of 7 g for the Chow 
forms of 100 Bordiga surfaces, elliptic scrolls and their unions. A second 
possibility is to count points and apply Corollary 13.21 As Figure |1] shows 
there is no difference between the two methods. The formula for the error 
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Surfaces in P 4 
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Figure 4. The 5% and the 95% quantiles of 7 g for the Chow 
forms of 100 Bordiga surfaces, elliptic scrolls and their unions 
compared with the error estimates for counting points on 
codimension 2 determinantal varieties rescaled. Using the 
geometry of Bordiga surfaces we obtain a better estimate. 



term underestimates the number of points on a elliptic scroll, because the 
scroll is irregular. 

The method of searching points at random in higher codimensional subsets 
of rational varieties helped us in proving the existence of several interesting 
components of Hilbert schemes. |Sch96| . [BTiEj . |vBEL04j 
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